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Abstract 

The slope of the best fit line from minimizing the sum of the squared oblique errors is the root of a 
polynomial of degree four. This geometric view of measurement errors is used to give insight into the 
performance of various slope estimators for the measurement error model including an adjusted fourth 
moment estimator introduced by Gillard and lies (2005) to remove the jump discontinuity in the estimator 
of Copas (1972). The polynomial of degree four is associated with a minimun deviation estimator. A 
simulation study compares these estimators showing improvement in bias and mean squared error. 
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1 Introduction 

With ordinary least squares OLS(y|a;) regression, we have data {(xi,Yi|AT = x±), (x n , Y n \X — x n )} 
and we minimize the sum of the squared vertical errors to find the best-fit line y — h(x) = /3 + ft\X. 
With OLS(?/|:e) it is assumed that the independent or causal variable is measured without error. The 
measurement error model has wide interest with many applications. See for example Carroll et al. (2006) 
and Fuller (1987). The comparison of measurements by two analytical methods in clinical chemistry is 
often based on regression analysis. There is no causal or independent variable in this type of analysis. The 
most frequently used method to determine any systematic difference between two analytical methods is 
OLS(y\x) which has several shortcomings when both measurement sets are subject to error. Linnet(1993) 
states that "it is rare that one of the (measurement) methods is without error." Linnet(1999) further states 
that " A systematic difference between two (measurement) methods is identified if the estimated intercept 
differs significantly from zero (constant difference) or if the slope deviates significantly from 1 (proportional 
difference)." Our paper concentrates on how to determine whether or not there is a significant difference 
between two measurement instruments using a Monte Carlo simulation; that is, we concentrate our studies 
about a true slope of 1. As in the regression procedure of Deming (1943), to account for both sets of errors, 
we determine a fit so that both the squared vertical and the squared horizontal errors will be minimized. The 
squared vertical errors are the squared distances from (x,y) to (x,h(x)) and the squared horizontal errors 
are the squared distances from (x,y) to (h~ 1 (y) 1 y). As a compromise, we will consider oblique errors. All 
of the estimated regression models we consider (including the geometric mean and perpendicular methods) 
are contained in the parametrization (with < A < 1) of the line from (x, h(x)) to (h~ 1 (y),y). 
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We review the Oblique Error Method in Section 2. In Section 3, we review the geometric mean and 
perpendicular error models. In Section 4, we show how the geometric mean slope is a natural estimator for 
the slope in the measurement error (error-in-variables) model. Section 5 shows a relationship between the 
maximum likelihood estimator in the measurement error model and the geometric mean estimator. We give 
a case study to illustrate the effects that erroneous assumptions for the ratio of variance of errors can have 
on the maximum likelihood estimators. Section 6 discusses a fourth moment estimator and shows a circular 
relationship to the maximum likelihood estimator. Section 7 develops a minimum deviation estimator derived 
by minimizing Equation (|2} in Section 2 with respect to A for fixed Section 8 contains our Monte Carlo 
simulations where we compare these estimators. Supporting Maple worksheets are available from the link 
http://people.virginia.edu/~der/QDriscoll_Ramirez/| 



2 Minimizing Squared Oblique Errors 

^From the data point (xt, y{) to the fitted line y = h{x) = Pq+^x the vertical length is eij = \yi — /3 — /3^Xi\ , 
the horizontal length is bi = \xi — (t/j — Pq)//3i\ = \{P\Xi — Hi + Po)/^i\ = \ a i/Pi\ and the perpendicular 

length is hi = dj/y 1 + /3?. With standard notation, S xx = X«=i( x « _ x) 2 \ $yy = Yli=i(Vi ~ H) 2 '■> S xy = 

J27=i( x i ~ z){Vi - v) witn the correlation p = S xy /^S xx S yy . 

For the oblique length from (xi,y%) to (h~ 1 (yi) + X(xi — h~ 1 (yi)),yi + X(h(xi) — yi)), the horizontal error is 
(1 — X)bi = (1 — A)aj/ and the vertical error is Adj. The sum of squared horizontal, respectively vertical, 
errors are given by SSEh(/3 , /3 1: A) = a f) I Pi an ^ SSE v (0 o , /3 X , A) = ^"=1 a 1- ^ n a comprehensive 

paper by Riggs et al. (1978), the authors place great emphasis on the importance of equations being 
dimensionally correct, since it is from these equations that the slope estimators are derived. In particular 
the authors state that: "It is a poor method indeed whose results depend upon the particular units chosen 
for measuring the variables ... and that invariance under linear transformations is equivalent to requiring 
the method to be dimensionally correct." So that our equation is dimensionally correct we consider 

SSE o (P , fr, A) = (1 - A) 2 ^ + A 2 ^ (1) 

where {a 2 , a 2 .} are Madansky's moment estimators of the variance in the horizontal, respectively vertical, 
directions. In Section 4, we show that this is equivalent to using 

SSE o {p ,Pi,\) = (l-\) 2 S yy SSE h + \ 2 S xx SSE v . (2) 

Similar to that shown in O'Driscoll, Ramirez and Schmitz (2008), the solution of dSSE o /d/3 = is given 
by /3q = y — P{x and hence 

SSE o {P ,P 1} X) = ((l-X) 2 Syy/p 2 1 +X 2 S XX ) {Syy-Z^Sxy + plS^). 

The solutions of dSSE a /df3 1 = are then the roots of the fourth degree polynomial equation in f3 1 , namely 



PaVi) = AV f^f^ - X 2 ^p?\ + (1 - A) 2 ^ - (1 - A) 2 ,/f^ = 0. (3) 
V yy yy yy V ^xx 

With A = 1 we recover the minimum squared vertical errors with estimated slope /?5' er , and with A = we 
recover the minimum squared horizontal errors with estimated slope p\ or . 

For each fixed A € [0.1], there corresponds $ x € [f3\ er , (3\ or ] which satisfies Equation (j3]), and conversely, 
for each fixed Pi £ [(3\ er , /3 J ,or ] , there corresponds A £ [0.1] such that minimizing the sum of the squared 
oblique errors has estimated slope P x . We measure the angle 6\ of the oblique projection associated with A 
using the line segments (x, y) to (x, h(x)) and (x, h(x)) to (h^ 1 (y),y). When the slope f3 1 is close to one, for 
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A near one we anticipate 9\ to be near 45° and for A is close to zero we anticipate 9\ to be near 135°. The 
angles are computed from the Law of Cosines. 

A similar argument to that of O'Driscoll et al. (2008) shows that Pa{Pi) has exactly two real roots, 
one positive and one negative with the global minimum being the positive (respectively negative) root 
corresponding to the sign of S xy . Riggs et al. (1978) in Equation (119) also noted the role of the roots of a 
similar quartic equation in determining the slope estimators. 



3 Minimizing Squared Perpendicular and Squared Geometric Mean 
Errors 

The perpendicular error model dates back to Adcock (1878) who introduced it as a procedure for fitting a 
straight line model to data with error measured in both the x and y directions. For squared perpendicular 
errors Adcock minimized SSE per (f3 Ql /3 1 ) = a ?/(l + P\) w ^h solutions (3q £T —y — ji\ er x and 



per (^yy S xx ) i y {Syy S xx ) 2 + 4:S xy ^ ^ 



2S x y 



provided S xy ^ 0. However, in this case, the equation which minimizes SSE per (fi , is dimensionally 
incorrect unless x and y are measured in the same units. 

For squared geometric mean errors, wc minimize SSE gm (/3 , /3 1 ) = J2i=i (vT^M) = J2i=i a l 
with solutions /Jq" 1 =y — f3 9 ™x and 

Pi" = ±\f Syy/ Sxx- (5) 
Proposition 1. The geometric mean estimator has oblique parameter A = 1/2. 



Proof: For /3f m = \/S yy /S xx ,we solve the quadratic equation P4(/3f m ) = for A. This equation reduces 
to a linear equation whose root is A = 1/2. □ 



4 Measurement Error Model and Second Moment Estimation 

We now consider the measurement error (errors-in-variables) model as follows. In this paper it is assumed 
that X and Y are random variables with respective finite variances o 2 x and dy, finite fourth moments and 
have the linear functional relationship Y = (3 Q + f3 1 X. The observed data {(xi,yi), 1 < i < n} are subject 
to error by Xi = X; + Si and yi = Yi + n where it is also assumed that 5 is N (0, a 2 ) and r is N(0, a 2 .). In 
our simulation studies we will use an exponential distribution for X. 

It is well known, in a measurement error model, that the expected value for j3\ er is attenuated to zero 
by the attenuating factor o\l{p\ + o~ 2 x ), called the reliability ratio by Fuller (1987). Similarly the expected 
value for fi ] ^ >r is amplified to infinity by the amplifying factor {a\ + a 2 ) Thus for the measurement 
error model, when both the vertical and horizontal models are reasonable, a compromise estimator such as 
the geometric mean estimator /3f m is hoped to have improved efficiency. However, Lindley and El-Sayyad 
(1968) proved that the expected value of /3f m is biased unless a 2 /ay = u 2 ja 2 x . 

Madansky's moment estimators for {a 2 , a 2 } are 





Q 

^xy 


n 




Syy 


fi\Sxy 


n 


n 



(6) 
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If as is known or can be approximated, Madansky used the first of the equations in Equation ([5]) to derive an 
estimator for P 1 but Riggs et al. (1978) in Figure 5 produced an example for which this estimator performs 
poorly. In general as is not known and concentration focuses on estimating the true error ratio a 2 . /a 2 . Our 
Monte Carlo simulation illustrates how poor estimates for the error ratio may lead to large biases using the 
MLE estimator. It would be interesting to determine if there is a slope estimator j3 l that is a fixed point for 
f3 1 — a T {f3 l )/as{Pi). This can be achieved with the geometric mean estimator /3? . 

Proposition 2. /3f m is a fixed point of the ratio function j3 l — ~\^}\ . 

Proof: Rewrite na 2 = S xx - S X yl3{ m = S xx - S xy ^f S xx / S yy and na T = S yy - S xy j3\ m = S yy - 
S xy y/Syy/S xx , from which a T /a s = ^S yy /S xx .U 

We return to the assertion made in Section 1. A natural standardized weighed average for the oblique 
model is shown in Equation ([T]) and using the fixed point solution of Proposition 2 in this equation yields 
the equivalent model given in Equation @. 



5 The Maximum Likelihood Estimator 

If the ratio of the error variances k = a 2 /a 2 is assumed finite, then Madansky (1959), among others, showed 
that the maximum likelihood estimator for the slope is 



pmie _ (Syy K S XX ) + y (S yy kS xx ) 2 + 4:np 2 S xx S yy . 

It also follows that if k — 1 in Equation J7| then the MLE (often called the Deming Regression estimator) 
is equivalent to the perpendicular estimator, /3^ er . Conversely, if the MLE is /3f er then k = 1. In the particular 
case where S xx = S yy then /3^ er has a A value of 0.5. We note that S yy /S xx is a good estimator of a y j a 2 , 
but in general, it is not a good estimator of the error ratio k — a 2 ./ cr|. In Section 6, we discuss a moment 
estimator k for n. 

The MLE is a function of {p, k, S xx /S yy }. Table 1 gives the corresponding /3" l ' e value for typical values. 
For fixed {k, p], the values for ji™ le in each column of Table 1 decrease. As expected, with k = 1 and 
S xx = Syy, the maximum likelihood estimator agrees with the geometric mean estimator, both being equal 
to 1.00. 



Table 1 

Values for /3™ ie for typical {p, k, S xx /S yy } 



K = 




0.500 


0.500 


0.500 


0.500 


1.000 


1.000 


1.000 


1.000 


2.000 


2.000 


2.000 


2.000 


P = 




0.200 


0.400 


0.600 


0.800 


0.200 


0.400 


0.600 


0.800 


0.200 


0.400 


0.600 


0.800 


Sxx i Syy 


= 1/2 


5.396 


2.828 


2.016 


1.632 


3.799 


2.219 


1.750 


1.535 


1.414 


1.414 


1.414 


1.414 


Sxx/ Syy 


= 1 


2.686 


1.569 


1.237 


1.086 


1.000 


1.000 


1.000 


1.000 


0.372 


0.638 


0.808 


0.921 


Sxx/Syy 


= 2 


0.707 


0.707 


0.707 


0.707 


0.263 


0.451 


0.571 


0.651 


0.185 


0.354 


0.496 


0.613 



In Table 2 we record the corresponding obliqueness parameter A for the maximum likelihood model for 
these typical values. Small values near support OLS(x\y), denoted by /3i° r , and large values near 1 support 
OLS(y|x), denoted by /3" er . For fixed {k, p}, the values for the obliqueness parameter A in each column of 
Table 2 increase indicating the model moves from p\ or towards f3\ er . With « = S yy /S xx , /3™ le = /3f™ as 
shown by the cells of Table 2 with A = 0.500. 
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Table 2 

Values for A for typical {p, k, S xx /S yy } 



K = 




0.500 


0.500 


0.500 


0.500 


1.000 


1.000 


1.000 


1.000 


2.000 


2.000 


2.000 


2.000 


P = 




0.200 


0.400 


0.600 


0.800 


0.200 


0.400 


0.600 


0.800 


0.200 


0.400 


0.600 


0.800 


Sxx/Syy - 


,1/2 


0.033 


0.111 


0.197 


0.273 


0.089 


0.223 


0.316 


0.375 


0.500 


0.500 


0.500 


0.500 


Sxx/Syy 


= 1 


0.089 


0.223 


0.316 


0.375 


0.500 


0.500 


0.500 


0.500 


0.911 


0.777 


0.684 


0.625 


Sxx/Syy 


= 2 


0.500 


0.500 


0.500 


0.500 


0.911 


0.776 


0.684 


0.625 


0.967 


0.889 


0.803 


0.727 



The Madansky's moment estimators {a g ,a T } depend on the choice of j3 1 . In Table 3, we record the effect 
of varying slopes on the moments and their ratio when computable. 

Table 3 



Error ratios for Madansky's moment estimators for varying f3 1 





~2 


~2 

°T 




over 

Pi 

phor 




n ° xx 


1-p 2 o 
n °VV 




oo 



/3f m 


1-p o 

n ° xx 


1-p c 

n °yy 




/3f r 


1 3 xx -\-Syy -\j{S xx S yy) 2 -\-4iP 2 S xx Syy 


1 S xx ~\~Syy -\J \S XX Syy)^ -\~4lp^ S xx Syy 


1 


2 n 


2 n 


jjrale 




1 f^S XX -\- Syy \J S X X Syy )^ ~\- 4 K S X X S ' y y 


K 


2 n 


2 n 



In the next section, we introduce a second moment estimator for k and a fourth moment estimator for 

6 Fourth Moment Estimation 

When n is unknown, Solari (1969) showed that the maximum likelihood estimator for the slope p\ does 
not exist, as the maximum likelihood surface has a saddle point at the critical value. Earlier Lindlcy and 
El-Sayyad (1968) suggested, in this case, that the maximum likelihood method fails as the estimator would 
be the geometric mean estimator which converges to the wrong value. Sprent (1970) pointed out the result 
of Solari does not imply that the maximum likelihood principle has failed, but rather that the likelihood 
surface has no maximum value at the critical value. 

Copas (1972) offered some advice for using the maximum likelihood method. He assumed the data has 
rounding-off errors in the observations which allows for an approximated likelihood function to be used, and 
that this approximated likelihood function is bounded. His estimator for the slope has the rule 

RCOp _( pT if 

11 "i (tr * ' 

so the ordinary least squares estimators are used depending on the whether |/3f m | < 1 or |/3f m | > 1. 

The Copas estimator is not continuous in the data as a small change in data can switch the direction 
of the inequality £y| < £a;| which will cause a jump discontinuity in the estimator /3™ p . To achieve 
continuity in the data, we adjust the range of the fourth moment estimator /?™ om described in Gillard and 
lies (2005) to account for admissible values for {ct^,ct^}. See also Gillard and lies (2010). 
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The basic second moment estimators for a\ and a 2 T are shown in Equation ([5]). Since variances must be 
positive, we have the admissible range for the moment estimator for fa as 

&xx ^xy 

Set S xxxy = Y)(xj — x) 3 (yi — y) and similarly for S xyy y Following Gillard and lies (2005), from Equations 
(22) and (24)), the fourth moment equations of interest are 

Sxxxy 7;— q~2~2 / n N 

= Pfi 4 + 3pa a s (9) 

n 

Sxyyy ~S 3 ~ qo~2~2 

= a « 4 + 6p<7 a, 

n 

with /i 4 denoting the fourth central moment for the underlying distribution of X. The four equations from 
([6|) and ((9]) allow for a moment solution for fa as 



Sxyyy <$S X ySyy 



h = \ 7 _ o g • ( 10 ) 

y ^xxxy 3o X yOxx 

In our simulation study, was well-defined around 99% of the time. If the radicand is negative, we 
recommend using the geometric mean estimator. 
To satisfy Equation © we define fa lom as 

fa er if Pi < PT r 

/3 X W if 0i > /3'i° r 

This is a Copas-type estimator with the moment estimator used to "smooth" out the jump discontinuity 
inherent in the Copas estimator. We next study the circular relationship between this moment estimator 
and the maximum likelihood estimator with fixed k. 

We will define the moment estimator as a function of fa, then use this value to compute /3™ ie («;) 

as a function of k. Finally, we note that fa^ le '(k(/3 \)) = fa, showing the circular relationship between the 
estimators {fa, (3™ le }. Thus our moment estimator also has the functional form of the maximum likelihood 
estimator_with fixed k. 

Set k(Pi) — a T /a$ so 

n{fa) = S vv-PiP\/ S^S vv_ . (12) 

S X X Pj fa \J SxxSyy 

We use K,(fa) in Equation ^ to determine [3™ le x )) . As fa -t p\ or the numerator in Equation (TTJ]) tends 
to zero so K(fi x ) -t and /3" 1 ' 6 (k(/3 1 )) p\ or ; similarly as fa -t fa er the denominator in Equation (fT2j) 
tends to zero so K>{fa) — > oo and j3™ le (K,(fa)) — > fa er . A stronger result is given in the following Proposition. 

Proposition 3. For each fa, fa 1 e (h[fa)) = /3 1 and in particular e (K(/3i)) = fix- 

Proof: In Equation © solve P™ le {n) = fa for n = no, and then check that kq is the same as in Equation 

An example helps to demonstrate the smoothing achieved with the moment estimator /3™ om . Assume 
{p = 0.5, S xx = I, Sxxxy = 10, Sxyyy = 5}. Equation (0) requires that 0.13029 < S yy < 1.31862. As 
Syy varies over the admissible values for S yy , H{fa) varies over [0, oo] and fa varies over [fa er , Pi° r ] and 
/3™ e (/t(/5i)) = fa, a surprising result. 
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Table 4 

Slope Estimates with {p = 0.5, S xx = 1, S xxxy = 10, S xyyy = 5} 



Syy 




Pi 








0.1303 


0.1805 


0.7219 


0.7219 


0.0000 


0.7219 


0.2000 


0.2236 


0.7222 


0.8944 


0.0558 


0.7222 


0.4000 


0.3164 


0.7145 


1.2649 


0.3123 


0.7145 


0.6000 


0.3873 


0.6977 


1.5492 


0.7412 


0.6977 


0.8000 


0.4472 


0.6734 


1.7889 


1.4850 


0.6734 


1.0000 


0.5000 


0.6417 


2.0000 


3.0760 


0.6417 


1.2000 


0.5477 


0.6020 


2.1909 


9.6582 


0.6020 


1.3186 


0.5742 


0.5742 


2.2966 


oo 


0.5741 



7 Minimum Deviation Estimation 

^From Section 1 with fixed f3 1 the solution of dSSE /d\ — is given by A = S yy / '(S yy + fi 2 S xx ). Substituting 
pmom £ Qr ^ - n t ^- s resu j t £ or ^ produces a Minimum Deviation type estimator which we denote by /3™ d , 
with P"f r < (3™ ld < (i\ or . Our simulation studies will support the efficiency of this Minimum Deviation 
estimator. 

Riggs et al. (1978) observes for S yy /S xx = 1 that the geometric mean estimator is unbiased when k = 1, 
negatively biased when n < 1, and positively biased when k > 1. The authors also state that "no one method 
of estimating j3 1 is the best method under all circumstances." To determine the efficiency of these estimators 
we conduct a Monte Carlo simulation in the next section. 

8 Monte Carlo Simulation 

Our Monte Carlo simulation uses X with an exponential distribution with mean [i x = 10 (and ax = 10) 
and Y — X so j3 l = 1 and /3 = 0. Both X and Y are subject to errors a 2 s , respectively a 2 where (aj, a 2 ) 
G {1,4, 9} x{ 1,4, 9}. The sample size n is chosen as 100. 

The first simulation, with the number of replications R = 100, summarized in Table 5, reports on the 
bias in the MLE estimator in using a misspecified value of k. For (crjj, erj) e {1,4, 9} x{ 1,4, 9}, k ranges 
with ratios from 1 : 1 to 9 : 1 The true error ratios of k are recorded in the first row and the assumed error 
ratios k # which are used to compute /3™ ie are recorded in the first column, both in ascending order. 

Table 5 



Percentage Bias of MLE estimator for the assumed ratios k # for varying values of n = a\ /a 2 s 











{01=1,00 


= 0,n 


= 100, R 


= 100} 








{k* 


,k} 


1 : 9 


1 : 4 


4 : 9 


1 : 1 


4 : 4 


9 : 9 


9 : 4 


4 : 1 


9 : 1 


1 : 


9 


0.166 


0.502 


2.164 


0.870 


3.663 


7.995 


8.723 


3.592 


9.282 


1 : 


4 


-0.914 


-0.012 


0.811 


0.666 


2.807 


6.087 


7.351 


3.067 


8.265 


4 : 


9 


-2.066 


-0.564 


-0.643 


0.445 


1.878 


3.999 


5.838 


2.496 


7.137 


1 : 


1 


-4.067 


-1.541 


-3.184 


0.051 


0.218 


0.266 


3.083 


1.467 


5.058 


4 : 


4 


-4.067 


-1.541 


-3.184 


0.051 


0.218 


0.266 


3.083 


1.467 


5.058 


9 : 


9 


-4.067 


-1.541 


-3.184 


0.051 


0.218 


0.266 


3.083 


1.467 


5.058 


9 : 


4 


-5.957 


-2.495 


-5.590 - 


-0.342 


-1.417 


-3.330 


0.338 


0.437 


2.936 


4 : 


1 


-6.956 


-3.016 


-6.856 - 


-0.561 


-2.310 


-5.230 


-1.161 


-0.136 


1.748 


9 : 


1 


-7.840 


-3.489 


-7.973 - 


-0.763 


-3.119 


-6.899 


-2.513 


-0.663 


0.657 
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As expected, the values for k = k show the smallest bias, and in each column for a given k the bias 
shows that the estimated slope moves from over estimating the true value to under estimating the true value 
of P 1 — 1. This was anticipated since for k near zero the maximum likelihood estimator favors j3\ or which 
over estimates p\. and correspondingly, for k near one the maximum likelihood estimator favors P\ er which 
under estimates $. 

We conducted a second large scale Monte Carlo simulation study with R = 1000 to demonstrate the im- 
provement in the adjusted fourth moment estimator fi™ om over the Copas estimator which has a jump 
discontinuity. Simulations for other slope estimators have been reported by Hussin (2004). We used 
an exponential distribution for X with [i x = 10, and set fi x = 1 and p\ = 0. The values for the er- 
ror standard deviations were (as, a T ) € {1,2, 3,4}x{l,2, 3,4}, the sample size was n = 100 and the 
number of replications R = 1000. We report in Tables 6, 7, and 8 the MSE and the Bias for the es- 
timators {/3r r ,/3f r ,/3f r ,/3f m ,/3™ om ,/3i a<i } for {*s, °t) € {(1, 2), (1, 3), (1, 4)}. Similar results hold for 
(as, a T ) G {(2, 1), (3, 1), (4, 1)} Note that in each case the adjusted fourth moment estimator /3™ om is 
more efficient than the Copas estimator. To see this we compare the pairs of values (MSE, Bias) in the 
three tables. For /3" iom these are {(1.001, -0.830), (2.786, -1.807), (5.717, -2.813)} and for Copas these 
are {(2.378,-2.410), (8.769, -7.347), (23.018,-13.848)}. In practice, the researcher may not know which of 
{a\, aj} is larger. If he does, then he may choose either of {(3\ er , /3i° r } with fi\ or favored when a 2 s is much big- 
ger than a"\. A fairer comparison is to use OLS(j/|x) and OLS(x|y) each 50% of the time. Thus in the Tables 
we report the average for the MSE and the average of the absolute deviation of the biases for the two OLS es- 
timators. These average (MSE, Bias) values from the tables are {(1.336, 2.518), (4.847, 4.831), (12.46, 7.858)} 
showing the improved efficiency of /3™ om . As anticipated the minimum deviation estimator /3™ achieves fur- 
ther improvement in reduction of (MSE, Bias) with values {(0.646, -1.336), (2.309, -3.584), (5.578, -6.288)}. 

Table 6 

X is Exp(10),/3 1 = 1,/3 Q =0,R= 1000, n = 100 

(cr T = l,a-,5 = 2) 

OLS* reports average MSE and average absolute Bias for {/?i 6r , /?i° r } 





MSE* 10~ a 


%Bias 


A 


Ox 


over 
Pi 


2.001 


-3.843 


1.000 


46.12 


OLS* 


1.336 


2.518 


NA 


NA 


nhor 
Pi 


0.670 


1.193 


0.000 


136.12 


pper 


0.688 


-1.396 


0.507 


89.99 


P{ m 


0.653 


-1.360 


0.500 


90.78 


pmom 


1.001 


-0.830 


0.339 


108.27 


p\ op 


2.378 


-2.410 


0.651 


74.47 




0.646 


-1.336 


0.497 


91.06 



Table 7 

X is Exp(10),p 1 = l,p = Q,R= 1000, n = 100 
(a T = I, a s = 3) 

OLS* reports average MSE and average absolute Bias for {/3\ er , (3\ or } 





MSE * 10~ ;5 


%Bias 


A 


9x 


over 
Pi 


8.370 


-8.459 


1.000 


47.53 


OLS* 


4.847 


4.831 


NA 


NA 


phor 
Pi 


1.324 


1.203 


0.000 


137.53 




2.688 


-3.954 


0.520 


89.60 




2.423 


-3.760 


0.500 


92.19 


^mom 


2.786 


-1.807 


0.318 


110.94 


f?° p 


8.769 


-7.347 


0.848 


58.14 


pmd 
Pi 


2.309 


-3.584 


0.490 


93.196 
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Table 8 

X is Exp{lQ),l3 1 = 1,/3 = 0,R= 1000, n = 100 
(cr r = I, a s = 4) 

OLS* reports average MSE and average absolute Bias for {f3\ er , f3 l l° r } 





MSE * 10-' d 


%Bias 


A 


Ox 




22.791 


-14.376 


1.000 


49.43 


OLS* 


12.46 


7.858 


NA 


NA 




2.134 


1.339 


0.000 


139.43 


/3f r 


7.406 


-7.480 


0.539 


89.95 


Pi 


6.242 


-6.880 


0.500 


94.08 


nmom 
Pi 


5.717 


-2.813 


0.286 


114.51 




23.018 


-13.848 


0.950 


52.71 




5.578 


-6.288 


0.480 


96.04 



9 Summary 

We have modified the fourth moment estimator of the slope from Gillard and lies (2005) to show how to 
remove the jump discontinuity in the estimator given by Copas (1972). We show how the moment estimators 
l^mom^ ~2| can ^ e used to determine an MLE estimator which surprisingly is the original moment 
estimator of the slope. Our simulations support our claim that both {/3™ om , are more efficient than 
the average of the OLS estimators. 
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